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Abstract 

We examine the growth, survival, and context of 256 novel hashtags during the 2012 U.S. presidential debates. 
Our analysis reveals the trajectories of hashtag use fall into two distinct classes: "winners" that emerge more quickly 
and are sustained for longer periods of time than other "also-rans" hashtags. We propose a "conversational vibrancy" 
framework to capture dynamics of hashtags based on their topicality, interactivity, diversity, and prominence. Statis- 
tical analyses of the growth and persistence of hashtags reveal novel relationships between features of this framework 
and the relative success of hashtags. Specifically, retweets always contribute to faster hashtag adoption, replies ex- 
tend the life of "winners" while having no effect on "also-rans." This is the first study on the lifecycle of hashtag 
adoption and use in response to purely exogenous shocks. We draw on theories of uses and gratification, organi- 
zational ecology, and language evolution to discuss these findings and their implications for understanding social 
influence and collective action in social media more generally. 



1 Introduction 

The hashtag is a ubiquitous and flexible annotation for 
Twitter users. Hashtags allow users to track ongo- 
ing conversations, signal membership in a community, 
or communicate non-verbal cues like irony. Hashtags 
vary considerably in their activity and audience, but lit- 
tle is known about how the use of novel hashtags co- 
evolve with the needs of their users and the presence 
of other hashtags. Why do some hashtags persist while 
others are just momentary blips? How do similar hash- 
tags compete for attention? 

Hashtags often reflect eccentric topics and their 
emergence is happenstance. These idiosyncrasies typ- 
ically limit researchers' ability to systematically com- 
pare features of their emergence and evolution. How- 
ever, the shared attention to and improvisational nature 
of the U.S. presidential debates provides a unique op- 
portunity to understand the social dynamics of novel 
hashtag adoption following repeated exogenous events. 
Before the October 2012 debates, the importance of 
concepts such as "big bird", "malarky", "binders", or 



"bayonets" to a political campaign would be minimal. 
Yet these unexpected terms now denote salient mo- 
ments from these debates and became rallying calls for 
partisans and punchlines for comedians. More broadly, 
Twitter users created hundreds of hashtags related to 
these topics which had no raison d'etre before the de- 
bates. The candidates' unscripted statements can thus 
be understood as creating an exogenous shock to the 
system of political discourse. Combined with fine- 
grained data about large scale and real time user be- 
havior create, these exogenous shocks create a set of 
natural experiments to systematically analyze the fea- 
tures that contribute to the growth and stabilization of 
hashtags. 

Previous research has examined the roles of "rele- 
vance" and "exposure" for hashtag adoption ifTTl l35l 
[l4j|34l|22j|2Tl. However, these features are inadequate 
for distinguishing hashtag trajectories in the context of 
an event as prominent as the U.S. presidential debates 
where relevance and exposure are effectively "maxed 
out." Furthermore, hashtags are more than labels for 
contextualizing statements, objects for bookmarking, 
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or channels for sharing information, but they are ac- 
tive virtual sites for constructing communities. We pro- 
pose an alternative framework called conversational vi- 
brancy to understand the dynamics of novel hashtag 
use based on how features such as topicality, interactiv- 
ity, diversity, and prominence interact with the commu- 
nities producing and consuming tweets to these hash- 
tags. 

We operationalize our conversational vibrancy 
framework with behavioral and structural variables 
such as retweets, replies, unique tweets, and audience 
size. Analysis of the relationship between these fea- 
tures and the rate and sustainability of hashtag adoption 
reveal the existence of two distinct classes of hashtag 
trajectories. "Winners" are hashtags characterized by 
more rapid growth and longer sustained interest than 
"also-rans." Statistical models of the growth and per- 
sistence of each class of hashtags reveals novel and 
complex interactions with features from our conversa- 
tional vibrancy framework as well as the concentration 
of user attention. Our analysis suggests conversational 
vibrancy contributes to the growth and sustenance of 
hashtags in different phases of their life-cycles. For 
example, while retweets and followers support a hash- 
tag's growth, they also paradoxically undermine its per- 
sistence. We draw on theories of organizational ecol- 
ogy and language evolution to discuss these findings 
and their implications for understanding social influ- 
ence and collective action in social media more gener- 
ally. 

2 Background 

Twitter is a popular "micro-blogging" platform 
founded in 2006 that allows users to receive and broad- 
cast (up to) 140-character messages called "tweets". 
The brevity and immediacy of the medium contributed 
to its rapid adoption as a way of sharing informa- 
tion from mobile devices, propagating information by 
"retweeting", as well as tracking relevant content by 
labeling tweets with "hashtags" ||5]. 

Hashtags first emerged on Twitter during the 2007 
San Diego wildfires as a way to track relevant informa- 
tion about a large scale natural disaster by labeling con- 
tent so that it could be filtered and shared via backchan- 
nels l33l . Unlike traditional tagging systems used for 
information archival, Twitter hashtags can serve either 
as a label for identifying topically relevant streams of 
message or a prompt for commenting and sharing [11]. 



Hashtags often fill a dual role as both a topical iden- 
tifier (e.g., #iPhone) and a symbol of a community 
membership (e.g., #VoteForObama) ll35ll . During the 
"Arab Spring" and other protests, hashtags were used 
by activists to coordinate their actions and garner sup- 
port El. 

Studies of hashtag use have primarily emphasized 
two dimensions: relevance and exposure. The rele- 
vance perspective emphasizes the dependence hashtags 
have on the topics being discussed: content relevant to 
the Twitter community draw more tweets and retweets 
and thus grow sustainably over time 11111 l35l [141 1341 . 
From the view of exposure, however, hashtag's public- 
ity and legitimacy is driven by social exposure as part 
of a process of complex contagion: hashtags grow be- 
cause they are adopted by users with social connections 
to one another, and by users' repeated use and spread- 
ing the word to others of interests ll22l 1211 . 

Analyses of hashtag adoption have characterized dif- 
ferences in the persistence of hashtags, modeled the 
spread of hashtags as a type of complex contagion re- 
quiring multiple exposures, and the behavior of hash- 
tags that emerge in response to exogenous shocks ETl 
1221 . Other studies of hashtag have focused on the tem- 
poral occurrences of the hashtags that are characterized 
by either "peaky" but ephemeral topics versus "persis- 
tent conversations" using less salient terms over longer 
periods of time l26l . In lfl4l . peaks in popular hashtags 
are the result of four discrete classes of activity and are 
primarily driven by exogenous rather than endogenous 
factors. While these analyses have analyzed the use 
of hashtags in the context of specific events, topics, or 
communities, they have not examined the ecology of 
multiple hashtags competing for attention following an 
exogenous event. 

2.1 Conversational vibrancy 

While prior work has emphasized relevance and expo- 
sure as mediators of hashtag activity, these features are 
inadequate for explaining activity during events such as 
elections, sports, and awards shows that attract unusual 
levels of shared attention and intense activity. The 
large audiences who view these popular events predis- 
pose them to generating hashtags that become widely 
adopted as users "dual screen" by tweeting about what 
they watch |[25l 1261 . During these events, exogenous 
shocks can simultaneously expose users to a large num- 
ber of candidate hashtags that are of similar relevance. 
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These constructs are thus inadequate to distinguish be- 
tween hashtags that emerge under these conditions. A 
more interesting question is why are some hashtags re- 
lated to these events adopted while others are not? 

Uses and gratification (U&G) is classic theoretical 
paradigm from mass communication research that ex- 
amines why people become involved in some types of 
mediated communication versus others and what grati- 
fications they receive from it. U&G emphasizes "what 
people do with media rather than what media do to peo- 
ple" which makes it especially salient for studying so- 
cial media like Twitter where users both produce and 
consume content. U&G argues that different forms of 
media compete to provide distinct uses such as news 
or entertainment and audiences actively seek out those 
media that are meaningful to them and can best satisfy 
their individual needs for information, social connec- 
tion, or emotional support Il23ll28ll . 

We draw from U&G theory to argue that users 
choose to employ some hashtags while ignoring oth- 
ers because patterns of use around these hashtags pro- 
vide a better match for their needs. However, we 
also recognize that patterns of communication [ 5 ] and 
perceptions of users' audiences |[T9l interact in com- 
plex ways with other contextual and content-level fea- 
tures l34l l32ll to influence the spread and adoption of 
content in social media fl31 . This suggests the need for 
an integrative framework that synthesizes these mul- 
tilevel processes to understand why users adopt some 
hashtags while rejecting others. We propose conver- 
sational vibrancy as an explanatory framework to un- 
derstand the emergence, growth, and persistence of a 
hashtag and its community of users. Conversational vi- 
brancy consists of four general elements, each of which 
we operationalize with behavioral features specific to 
Twitter: 

• Topicality is a kind of relevance that reflects the 
extent to which tweets labeled by a hashtag are 
timely and have contextual relevance for on-going 
conversations. "#bigbird" was highly topical fol- 
lowing the first debate but was not topical during 
the inauguration because its humor and relevance 
are lost in this different context. The number of 
times a hashtagged tweet created by one user is 
retweeted reflects the value this information has 
for other users. Hashtags are more likely to grow 
and persist when users re-share hashtagged con- 
tent created by others with their own networks. 



• Interactivity is the extent to which individuals at- 
tend to others' statements by responding to them 
in turn. Hashtags can denote spaces for on-going 
interpersonal conversation, supportive interest- 
based communities, or heated debates that bring 
others into the community and get them to adopt 
the hashtags as well. A hashtag may encourage 
people to launch discussions about the topic or 
people may simply reply to a user with a hash- 
tag to acknowledge his or her wit. The number of 
replies co-occurring with a hashtag captures the 
extent to which users are not only paying atten- 
tion to others' content but actively responding to 
one another. Hashtags are more likely to grow 
and persist when there are many users speaking 
directly to each other while invoking the hashtag. 

• Diversity captures whether activity is concen- 
trated in the same information or people are at- 
tending to many different sources of information. 
Hashtags that emerge because of a single "one hit 
wonder" tweet being retweeted over and over are 
qualitatively different from many unique tweets 
using the same hashtag. Because counting the ab- 
solute frequency of tweets observed using a hash- 
tag can be biased towards hashtags being hus- 
tled by spammers, the number of unique tweets 
that have been retweeted at least once provides 
a reasonable threshold for capturing the breadth 
of tweets attracting attention. Hashtags are more 
likely to grow and persist when many unique 
tweets with the hashtag are being shared. 

• Prominence measures the audience exposed to 
a hashtag by a user mentioning it. The audi- 
ence of Twitter users plays a crucial and com- 
plex role in information sharing behaviors [19]. 
Features of users themselves sharing information 
can have greater influence than the content itself 
being shared [34]. Hashtags mentioned by users 
with large audiences of many followers are more 
likely to result in rapid growth and sustained ac- 
tivity than hashtags mentioned by users with small 
audiences. 

In summary, hashtags exhibiting higher levels of con- 
versational vibrancy should gratify more users' needs 
and thus contribute to the success of the hashtag. The 
user-generated and socially-mediated nature of Twit- 
ter will create a positive feedback loop through which 
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hashtags with high conversational vibrancy will prefer- 
entially recruit more attention from users who in turn 
increase the vibrancy of the hashtag while also decreas- 
ing the communicative vibrancy of competing hash- 
tags. We discuss these evolutionary dynamics and eco- 
logical interactions in more detail in the discussion. 

3 Our Approach 

Contemporary presidential campaigns in the United 
States are highly scripted and candidates offer repet- 
itive "stump speeches" that rarely provide new infor- 
mation about their position or strategies ll27l . How- 
ever, debates are highly ceremonial and enthralling 
events that not only attract large audiences of pun- 
dits, partisans, and undecided voters, but also require 
candidates to respond directly to critiques and impro- 
vise responses outside of well-worn campaign talking 
points ll24l . As a result, these debates are ripe for un- 
expected and embarrassing statements. These "gaffes" 
can be impromptu exchanges that make a candidate 
look foolish or outright factual errors that undermine 
a candidate's credibility and often later become fod- 
der for opponents, sometimes substantially changing 
the dynamics of a race Q- Examples of "gaffes" 
from the first presidential debate include Mitt Rom- 
ney's unprompted threat to cut funding for the popular 
PBS children's television show "Sesame Street" with 
its iconic lead character Big Bird as well as Barack 
Obama's widely-derided aloof performance. 

The combination of fine-grained behavioral data 
about a large population of people and high-stakes cir- 
cumstances of Presidential debates provide a unique 
opportunity to analyze how a large and engaged au- 
dience reacts to statements to which it had not pre- 
viously been exposed. Because substantial portions 
of the audience are monitoring or participating in so- 
cial media while simultaneously watching the live de- 
bate [20], these gaffes immediately prompt behavior on 
social media as users try to confirm a statement or alter- 
natively improvise humorous responses to mock it. Un- 
like prior studies of hashtag adoption, threats to validity 
from confounded endogenous processes are minimized 
since gaffes and other unexpected statements made by 
candidates are inherently unpredictable and exogenous 
from our analysis. 

Examples of tweets containing the hashtags from the 
first debate are provided in Table [T] The table shows 
tweets that were posted immediately after or close to 



the onset time of different hashtags, focusing on the 
most popular, politically relevant hashtags or their vari- 
ants. In the first debate, after Mitt Romney mentioned 
"Big Bird," there were several similar hashtags, includ- 
ing #bigbird2012, #bigbird and #savebigbird that ap- 
peared simultaneously in users' tweets (around 21:28 
EDT). The #supportbigbird hashtag was popularized 
15 minutes after it, largely due to intensive retweet- 
ing of a newly-created account under the handle of 
"@BIGBIRD". 

We examine the dynamics of hashtag adoption dur- 
ing and immediately following each of the four de- 
bates during the 2012 presidential campaign. To under- 
stand why some novel hashtags are adopted while oth- 
ers are not, we measure two specific outcomes: the ini- 
tial growth of the hashtag as users adopt it and the per- 
sistence of hashtag use after this initial growth phase. 
Because the behavior we model in this project is highly 
domain-specific to politics, we identify and preferen- 
tially sample from a population of politically active 
users to extract their tweeting history. For this sample 
of users, we examine the emergence of hashtags which 
are "novel" because they had not appeared in users' 
feeds beforehand and identify 256 of the most popular 
and domain-relevant hashtags. Using a variety of infer- 
ential statistical modeling approaches, we examine the 
relationship between the features of the conversational 
vibrancy framework and the growth and persistence of 
these hashtags. 

3.1 Dataset 

While randomly sampling content from Twitter's "gar- 
den hose" and aggregating it to understand collective 
behavior is methodologically defensible under some 
research designs, we argue it is inappropriate for our 
purposes. We identify a relevant sub-population of 
users as follows. Users with relevant characteristics are 
identified first, and tweets for these users are extracted 
for analysis. This focus on the tweets from specified 
users reduces the threat of selection bias where inclu- 
sion in the observed set is correlated with the dependent 
variable ifTTIl . The dependent variables in this study re- 
fer to changes in tweet volumes over time. We thus 
focus on a fixed set of users — those that tweeted heav- 
ily during the debates — across the entire observation 
period. Changes in tweets to the hashtag can thus be 
attributed to choices made by these users. 

User sampling. First, we identified politically- 
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Table 1: Examples of debate-related tweets from the first debate on 4 October 2012 



hashtag 


time (EDT) 


tweet text 


bigbird2012 


21 


28:12 


#bigbird2012 


bigbird 


21 


28:12 


Romney stopping subsidy to PBS telling the host that?! wooo #BigBird 


savebigbird 


21 


28:15 


#savebigbird #debate 


bigbird 


21 


28:18 


How you cut #BigBird though #Romney....Wtf??? Smh 


bigbird 


21 


28:20 


@MittRomney says he will cut funding to PBS even thou he likes Lehrer and #bigbird #debates 


savebigbird 


21 


28:50 


Cut PBS? Noooooooooooi #SaveBigBird 


supportbigbird 


21 


46:37 


RT @B1GB1RD: Yo Mitt Romney, Sesame Street is brought to you today by the letters F U! 








#debates #SupportBigBird 


supportbigbird 


21:51:54 


This entire election is now about who will save Big Bird. #supportbigbird #debates 


supportbigbird 


21:52:45 


#OccupySesameStreet #SupportBigBird we are the 47% 



Table 2: Summary of datasets. All times are Eastern Daylight Time (EDT). 



Debate number 


1 


2 (Vice Presidential) 


3 


4 


Debate starting time 


3 Oct. 21:00 


11 Oct. 21:00 


16 Oct. 21:00 


22 Oct. 21:00 


Tweet volume at peak 


3,268,918 


2,388,963 


3,608,291 


2,415,703 


Unique users at peak 


174,297 


155,739 


181,329 


152,538 


"Novel" hashtags 


92,432 


58,165 


91,705 


77,526 


"Pop" hashtags 


75 


57 


82 


42 


Tracking conclusion time 


7 Oct. 02:00 


15 Oct. 02:00 


20 Oct. 02:00 


26 Oct. 02:00 



(a) DEB 1 (b) DEB 2 



6000 - 




^ 1 1 1 1 1 H — i 1 1 1 1 1 H 

01:00 01:30 02:00 02:30 03:00 03:30 04:00 01:00 01:30 02:00 02:30 03:00 03:30 04:00 

time(UTC) time (UTC) 

Figure 1 : Cumulative tweet volume of top 



active users who tweeted using a hashtag such as "#de- 
bate" or mentioned either candidate's Twitter account] 
during any of the four presidential debates. Using 
Twitter's "garden hose" streaming AP^J users whose 
tweets appeared in the feed were selected into our pop- 
ulation. Second, we extracted these users' tweeting 
histories beginning in mid- August 2012 through late 
October 2012 using Twitter's REST AP^\ Because 
these queries are expensive owing to rate limits, we pri- 
oritized users who tweeted during more of the debates. 
Thus users who tweeted during all four debates are 

1 Obarackobama, @mittromney 
2 https://dev.twitter.com/docs/streaming-apis 
3 https://dev.twitter.com/docs/api/l . 1 



(c) DEB 3 (d) DEB 4 




— I 1 1 1 1 1 H — i 1 1 1 1 1 H 

01:00 01:30 02:00 02:30 03:00 03:30 04:00 01:00 01:30 02:00 02:30 03:00 03:30 04:00 



time {UTC) time (UTC) 

hashtags over time, starting from each debate. 

more likely to be represented in the sample than users 
who tweeted during only one of the debates. While bi- 
ased, this population captures users such as journalists, 
pundits, politicians, and activists who are more polit- 
ically active and thus relevant to our research setting. 
The subset of the resulting corpus used in this analy- 
sis contains 123,560,785 tweets from 2,516, 125 unique 
users posted between September 29 and October 27. 

Descriptive statistics for each of the four debates are 
summarized in Table [2] Each debate occurred between 
21:00 and 22:30 EDT and we tracked the behavior of 
our population of users for 77 hours following each de- 
bate. The tweet volume peaked at the first hour of each 
debate and remained high at the second hour. We refer 
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the two-hour window as "peak window" and identify 
"novel" hashtags that were born in each of the peak 
windows. Figure [T] shows the growth of top five hash- 
tags in terms of the cumulative number of tweets for 
each debate. 

Pre-processing. We extract "novel" hashtags as all 
hashtags that were contained in tweets posted during 
one of the peak windows but did not appear in any 
tweets posted within 96 hours prior to the peak win- 
dows. For each novel hashtag, we extract the users who 
have mentioned the hashtag within the 77 hour post- 
debate window starting from the peak window. We 
then select hashtags that have been mentioned by at 
least 100 unique users to be "pop" hashtags. Despite 
the features of our politically active user population, 
many of these hashtags were not related to the debate or 
politics generally and were removed unless the content 
of the tweet contained terms such as "debate," "pres- 
ident," or any of the four candidates' names. The re- 
sulting corpus contained 256 "pop" hashtags that were 
novel, widely used, and relevant to the debates. The 
numbers of pop hashtags from each debate are reported 
in Table [2] 

The conversational vibrancy framework outlined 
above is operationalized using the following variables. 

1. Number of retweets (rt) under the hashtag is an 
indicator of its topicality. A retweet is an exam- 
ple of an individual identifying a tweet that is so 
interesting or relevant that they wish to re-state it, 
verbatim, to their own followers. Hashtags attract- 
ing more retweets have higher topicality. 

2. Number of replies (rp) to a hashtag is an indica- 
tor of its interactivity. A heated debate may be 
more interactive than a collection of statements 
that agree but do not address one another. 

3. Number of unique retweet sources (src) measures 
the diversity of the conversation. A hashtag with 
more unique retweet sources provides more di- 
verse information. 

4. Expected follower size (follow) is proxy for the 
prominence of the users who tweet to the hash- 
tag. Users with many followers tweet to a hash- 
tag will increase the growth and persistence of the 
hashtag. 



(a) after 6 hours 




— bigbird 

— - supportbig6in 

— ■ savebigtfir) 

— obamaclebalaexcaSSs 
occuaysesamestreet 



01:00 02:00 03:00 04:00 05:00 


06:00 


07:00 


time (UTC) 






(b) after 48 hours 







2 9000 ■ 
> 

I 6000 ■ 
I 3000 - 
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time (UTC) 

Figure 2: Cumulative tweet volume of top hashtags in the 
first debate, over the first 6 and 48 hours. Four out of the 
five hashtags appeared due to Romney's statement regarding 
PBS, and one was related to the debate more generally. 

3.2 Characterizing Hashtag Growth and Per- 
sistence 

Figure [jja) reveals an interesting hashtag dynamics 
during the first presidential debate: while the hashtag 
"#bigbird" was born and rapidly grew at around 21:30 
(1:30 UTC), the hashtag "#supportbigbird" which was 
born 15 minutes after took over in about 10 minutes. 
To see if this is simply the end of the story, we in- 
spect their growth for a longer period. As shown in 
Figure [2ja), after six hours from the start of the de- 
bate, "#supportbigbird" was still on top of "#bigbird." 
However, in a larger time scale as in Figure ^b), we 
discover that "bigbird" won back in the 12 th hour af- 
ter the debate. The differences between the short-term 
and long-term dynamics of hashtags lead to an inter- 
esting question: How can we characterize the complex 
dynamics of these emergent hashtags? 

From the temporal curves it is clear that the dynam- 
ics can not be easily captured by a single process, and 
hence the parametric modeling approach suggested by 
prior work (e.g., 0) is not appropriate. We focus 
on analyzing two specific features of the "pop" hash- 
tags' adoption dynamics. Growth measures the rate of 
change in hashtag use over observation window. A 
hashtag that is mentioned 1000 times in 1 hour has 
higher growth than a hashtag that is mentioned 1000 
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times in 10 hours. Persistence measures the sustained 
activity of hashtag use in new tweets over time. A 
hashtag that continues to be mentioned 70 hours after 
the debate has greater persistence than a hashtag that 
stopped being mentioned after 12 hours. 

Instead of fitting the curves by a parametric model, 
we capture the shape of curves by fitting a spline func- 
tion. Figure [3] show three different patterns for the 
emergent hashtags. A hashtag may grow extremely 
fast and saturate quickly, as shown in Figure [3ja), 
grow slower but also slowly saturate, as shown in Fig- 
ure [3jb), or grow fast and sustain for a longer period, 
as in Figure |3jc). We then quantify the growth of a 
hashtag as the largest slope over its fitted spline func- 
tion. The slope is measured in number of tweets per 
minute (tpm), so a hashtag with a growth 60 indicates 
the hashtag gathered 60 tweets per minute in its fast 
growth phase. 

We identify three critical time points along a hash- 
tag's growth curves: 

• onset time (0): the time where the hashtag was 
first mentioned in our dataset. 

• saturated time (t e ): the time where the hashtag's 
size (in terms of total number of tweets) reach 
99% of its final size in our dataset. We use 99% 
instead of final size to avoid the influence from 
miscellaneous outliers. 

• turning point (t*): the time point where the hash- 
tag growth curve starts deviating from its tangent 
line of the largest slope. This turning point can be 
found by a line search procedure along the tangent 
line. 

The persistence is measured as the duration between 
its onset time and the saturated time t e . In addition, 
the turning point divides a hashtag's life-cycle into an 
initial fast growth phase followed by a relatively slow 
growing phase before reaching its saturated point. 

4 Results 

We use this population of 2.5 million politically- active 
Twitter users and 256 "pop" hashtags to first describe 
general patterns of hashtag growth and persistence. 
Based on behavioral features from the conversational 
vibrancy framework, we use cluster analysis meth- 
ods to identify two distinct "winner" and "also-ran" 
hashtag classes. Integrating these approaches, we de- 
velop statistical models to inductively analyze how fea- 



tures of conversational vibrancy covary for both hash- 
tag classes in the growth and persistence phases of their 
adoption. 

4.1 Categorizing popular hashtags 

Figure |4] shows the 256 hashtags along the three di- 
mensions: growth, persistence, and final size (the total 
number of tweets). We use k-means clustering to iden- 
tify the two distinct classes of hashtags: "winners" (in 
red) grow rapidly and have high levels of persistence 
while "also-rans" (in blue) hashtags have either slower 
growth or less persistence. The "winner" class corre- 
spond to the cluster with relatively large final size. The 
descriptive statistics of the 12 winner hashtags are sum- 
marized in Table [3] 




1000 2000 3000 4000 5000 



Figure 4: Two clas'SWof popular hashtags. "Winners" (in 
red) have high growth and persistence (lifetime) with large 
final size (i.e., total number of tweets). "Also-rans" (in blue) 
are the rest of "pop" hashtags. 

Table 3: Hashtags in the "winner" class 



DEB 


Hashtag 


total 


growth 


persistence 


tweets 


(tpm) 


(mins) 


1 


bigbird 


12667 


45.45 


4405 


1 


obamadebateexcuses 


7617 


24.12 


2999 


1 


supportbigbird 


6289 


61.35 


2493 


1 


savebigbird 


4721 


27.60 


4118 


2 


thingsthatmakebidenlaugh 


9290 


63.10 


1907 


2 


malarkey 


7108 


50.90 


3669 


2 


detailsmatter 


6358 


62.68 


2249 


3 


bindersfullofwomen 


22287 


51.10 


4003 


3 


sketchydeal 


5704 


54.55 


3733 


4 


horsesandbayonets 


8266 


37.27 


2365 


4 


strongerwithobama 


5610 


55.07 


2162 


4 


proudofobama 


5502 


37.28 


2830 



4.2 Explaining Hashtag Growth 

We use time-series regression to discover the relation- 
ships between the hashtag growth and different aspects 
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(a) high growth 
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(b) high persistence 



P" 



teambigbird 



1000 2000 



(c) high growth and high persistence 



& 



bigbird 



3000 4000 



Figure 3: Characterization of hashtag growth and persistence, 
line) over its fitted spline function (in blue dashed line). 

of conversational vibrancy. Our goal is to model the 
minute-by-minute hashtag growth from the onset up to 
the turning point. A time-series regression allows us 
to fit a model of dependent variable (growth) on inde- 
pendent variables (conversational vibrancy) where the 
serially correlated errors are captured by a linear au- 
toregressive moving-average specification. Concretely, 
we consider the dependent variable yt as the number of 
new tweets for a hashtag at time t where t is between 
the hashtag onset time and its turning point t* . 

For each hashtag, we include the following predic- 
tors that reflect its associated conversational vibrancy 
and a control: 

• rtt_i: the number of new retweets at previous 
time t — 1. 

• rp t _ 1 : the number of replies at time t — 1. 

• srcf_ 1 : the number of new retweet sources up to 
time t — 1. 

• follow" x : the expected largest audience size up 
to time t — 1. To compute this, we identify the set 
of users U who mentioned the hashtag up to time 
t — 1, and then compute the expected follower size 
from the users whose follower size is on the top 
\Q th percentile in set U. 

We use the notation (•)" to indicate whether the pre- 
dictor is an aggregate measure aggregating data from 
time to time t—1. To fit the linear model, the variables 
yt, rtt_i, rp t x and follow^ are log-transformed. 
We use a Box- Jenkins autoregressive integrative mov- 
ing average (ARIMA) modeling framework ifTOl to 
evaluate the the autocorrelation function and partial au- 
tocorrelation function of the residuals, and determine 
a model with second-order autoregressive process and 
first-order moving average. The model can be specified 



We capture the growth of a hashtag as the largest slope (in red 



as: 



y t = /3 T Xi_i + e t 

e t = 4>iet-i + + v + ipv t -i 

where x t _i is a vector containing the time-dependent 
predictors, (3 is a vector of parameters to estimate, e is 
the error, 4>i and <p2 are the first- and second-order au- 
tocorrelation parameters, tp is the first-order moving- 
average parameter. The estimated parameters are sum- 
marized in Table |4] The table shows parameter esti- 
mates (standard errors in parentheses) for the regres- 
sion models. 

In the "also-ran" class, hashtag growth has signifi- 
cant and positive associations with rt|_i, rp t-1 and 
followfl 1 . In the "winner" class, the growth is posi- 
tively associated with rt t -i and follow" x , but has a 
weak and negative associations with src"_ 1 . In both 
classes, hashtags whose tweeters have more followers 
(followf^) tend to grow faster. This is consistent with 
the exposure explanation [21]]. Retweets are signifi- 
cant predictors of growth for both winners and also- 
rans. This finding is consistent with arguments from 
organizational ecology [6], which suggest that organi- 
zations and communities with narrow identities that fit 
closely to the environment tend to thrive in the short 
term. Hashtags populated by many retweets relevant 
to the Twitter community at a particular moment ap- 
pear to grow more quickly, as users are drawn in to the 
"hot" conversation. 

The results for replies and growth are not as sim- 
ple. Replies appear to help also-ran hashtags to grow 
quickly, but do not appear to help "winner" hashtags. 
For also-ran hashtags, high interactivity would be as- 
sociated with high growth, as users may be seeking 
to participate in conversations where it appears others 
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Table 4: Growth models 



Variables Winner Also-ran 



rtf_i 0.0626** (0.0239) 0.2651*** (0.0073) 

rp t _ x -0.0128 (0.0161) 0.1642*** (0.0108) 

srcf_! -0.0016*** (0.0002) 0.0000 (0.0000) 

follow^ 0.1048** (0.0375) 0.0891*** (0.0042) 

Loglik -476.54 -26760.69 

AIC 971.07 53539.38 



To capture a hashtag's earlier conversational vi- 
brancy, we focus on activities occurring in its fast 
growth phase. Hence the predictors we choose to ex- 
amine are aggregate measures for each hashtag at the 
turning point t* , including: 

• rtf, : the total number of retweets received up to 
the turning time point t*, i.e., between and t*. 

• rp", : the total number of replies posted up to time 
f*. 

• src°» : the total number of retweet sources up to 
time t*. 

• followf, : the expected largest audience size up to 
time t*. The computation is similar to followf_ 1 
except that the time is fixed at t* . 

The notation (•)** indicates that the predictor is an 
aggregate measure. Table [5] summarizes the results for 
the survival analyses. All coefficient are exponentiated 
and thus interpretable as multiplicative effects on the 
hazard (standard error in parenthesis are not exponen- 
tiated). For the winner class, the predictors rtf* and 
rp"» have significant coefficients. For example, an ad- 
ditional reply reduces the per minute hazard of hash- 
tag saturation by 0.65%. A positive coefficient in- 
creases the value of the hazard function and therefore 
indicates a negative effect on survival time. In other 
words, rp"* has a positive association with a hash- 
tag's persistence and rtf* has a negative association 
with the persistence. The other predictors, srcf* and 
followf, , have no significant effect. For the also-ran 
class, rtf, has negative effect while srcf, has positive 
effect on the hashtag persistence. 

As expected, and consistent with exposure explana- 
tions, the number of followers is not a significant pre- 
dictor of a tag's longevity. Also consistent with the ex- 
planation for growth is the finding for retweets. Orga- 
nizational ecology predict that organizations and com- 
munities with specific identities strongly coupled with 
the environment are risk of demise when the environ- 
ment changes [6]. The very thing that made them ap- 
pealing when the environment was suited to what they 
had to offer now makes them less useful. For exam- 
ple, a hashtag that mainly contains jokes about the Big 
Bird reference may be popular in the initial frenzy af- 
ter the comment. As the debate moves on or ends and 
serious issues come to the fore, the humorous hashtags 
lose their relevance to the new environment. 

Interactivity and diversity present a more complex 



are paying attention to what they say. The fact that 
replies do not appear to supply resources for the growth 
of winning hashtags is surprising, but begins to make 
sense in light of the findings for uniqueness (see be- 
low). 

The results for uniqueness (src°_ 1 ) also show a dis- 
tinction between the two classes. "Winner" hashtags 
appear to be constrained by diversity, whereas "also- 
ran"tags are not. Together with the results for replies, 
this suggests that these classes are distinguished by dif- 
ferent conversational vibrancy. "Winner" hashtags ap- 
pear to be those that gain strictly from their relevance 
to the environment, i.e., they have interesting tweets. 
When fitness is high, diversity becomes a drag [ 12]. By 
contrast, "also-ran" tags appear to be less relevant. At 
the same time, what is important for growth is the abil- 
ity to bring something beyond relevance — interaction 
with others. 

4.3 Explaining Hashtag Persistence 

To examine the effect of earlier conversational vibrancy 
on a hashtag's persistence, we use survival analyses 
based on the Cox proportional-hazards model [8]. We 
examine the survival time of a hashtag starting from 
the turning point i*to the saturated time t e . Let T be 
a continuous random variable, the survival function is 
defined as: S(t) = Pr(T > t), the probability that the 
hashtag will survive (i.e., not saturated) beyond t. 

A hazard function assesses the instantaneous risk of 
demise at time t, conditional on survival to that time. 

, M ,. Pr[(t <T<t + At)} 

h(t) = hmAt-^o — - 



At 

In the Cox regression model, the log hazard can be 
specified as: 

h(t) = h (t)exp(p T x) 

where x is a vector of predictors, (3 is a vector of pa- 
rameters to estimate, ho(t) is the baseline hazard. 
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(a) Winner (b) Also-ran 




minutes ^minutes 

Figure 5: Estimated survival function S(t) for the two 
classes. Half of the hashtags in the winner class lived be- 
yond 2.5 days, and half of the hashtags in the also-ran class 
died out (become saturated) in two hours. The dash lines 
show a point- wise 95% confidence envelope around the sur- 
vival function. 



Table 5: Persistence models 



Variables 


Winner 


Also-ran 


rt* 


1.001* (0.0003) 


1.001* (0.0003) 




0.9935* (0.0003) 


1.000 (0.0002) 


src"» 


1.001 (0.0014) 


0.9933*** (0.0019) 


follow", 


1.000(0.0000) 


1.000 (0.0000) 


Loglik 


-15.92968 


-1028.731 


AIC 


39.85936 


2065.461 



set of findings. More replies increase the longevity of 
"winner" hashtags while having no effect for "also- 
rans." Simultaneously, diversity extends the life of 
also-rans but has no effect for winners. This is addi- 
tional evidence of a qualitative difference between the 
two classes of hashtags. Replies appear to buffer win- 
ner tags from the inevitable decline in their relevance 
over time O. That is, "winner" hashtags with many 
replies may able to fall back into the niches that were 
originally occupied by the "also-rans" — a place to ex- 
change ideas about a topic, rather than just to broad- 
cast one's opinions or humorous insights. The fact 
that "also-rans" benefit from diversity, but "winners" 
do not, suggests that some "also-rans" may build up a 
sufficient community that wishes to continue the con- 
versation even as new topics emerge and relevance de- 
clines. Fig. [5] shows the estimated survival function 
S(t) for the two classes. In the winner class, half of the 
hashtags lived beyond 2.5 days. In the also-ran class, 
half of the hashtags died out (become saturated) in two 
hours. 



Table 6: Growth models with co-occurring hashtags 





WinnPT 

T V 1 1 11 IV. 1 


A l^o-ran 

l \ 1 . 1 V ^ 1 till 


rt t _i 


0.0596* (0.0240) 


0.2817*** (0.0073) 


r P f -i 


-0.0141 (0.0161) 


r\ i c z' f \ -t- -i- -t- / r\ r\-\ r\r\\ 

0.1560*** (0.0109) 


src"_ 1 


-0 001 f,*** (0 0007 s ! 


001 V*** (0 000^"i 


follow"^ 


0.0989** (0.0380) 


0.0811*** (0.0042) 


rtEnv t i 


-0.0028 (0.0271) 


0.0530*** (0.0043) 


rpEnv t _! 


0.0113 (0.0174) 


0.0173*** (0.0038) 


srcEnv^Lj 


0.0014 (0.0016) 


0.0000 (0.0000) 


Loglik 


-475.89 


-26793.36 


AIC 


975.79 


53610.72 



4.4 Examining Environmental Context 

In our analysis, all the "pop" hashtags emerged from a 
unusual condition where users' attention was concen- 
trated on debate events. It is possible that a hashtag's 
growth and death is largely determined by this partic- 
ular environmental condition rather than the conversa- 
tional vibrancy of the hashtag itself. To test the ro- 
bustness of our findings against these concerns, we re- 
construct the prior growth and persistence models and 
include variables for users' tweeting activity for other 
hashtags at the same moment. 

We extend the growth models to include three time- 
dependent environmental covariates. For each hashtag 
i, we have: 

• rtEnv t _i: the total number of new retweets that 
do not contain hashtag i, at previous time t — 1. 

• rpEnv t _ 1 : the number of replies that do not con- 
tain hashtag i, at time t — 1. 

• BrcEnv^_]_: the number of new retweet sources 
that do not contain hashtag i, up to time t — 1. 

Based on the time-series regression described above, 
we include the three new predictors into new mod- 
els of growth. The estimated results are summarized 
in Table [6] Interestingly, none of the environmen- 
tal covariates has significant effect on the growth of 
winner hashtags. However, the predictors rtEnv^i 
and rpEnv t _ 1 are positively correlated to the growth 
of "also-ran' hashtags. While these effect are not as 
strong as rt$_i and rp t _ l5 the results suggest that 
"also-ran" hashtags benefit from other tweeting activi- 
ties. 

To study the relationship between hashtag's persis- 
tence and the environment, we include three new pre- 
dictors in the survival analyses. Unlike the original 



10 



Table 7: Persistence models with co-occurring hashtags 



Van nhl f**; 

V Cll luUl^J 


Wi n n f*r 

T V 1 1 11 It. 1 


A 1 tin-ran 

1 \ 1 .1 V 7 1 till 


rt t «. 


1.001 (0.0004) 


1.001** (0.0003) 




r\ r\r\f\ a -> /r\ r\r\ a c \ 

0.9904* (0.0045) 


1.000 (0.0002) 




1 00? (0 00? 1 


QQ?8*** (0 001 Q"! 


follow?, 


1.000 (0.0000) 


1.000 (0.0000) 


rtEnv t 


1.428 (2.688) 


0.7173* (0.1502) 


rpEnv t 


1.548 (2.288) 


0.6976* (0.1709) 


srcEnv" 


2.023 (1.560) 


1.020 (0.01799) 


Loglik 


-11.65331 


-1023.476 


AIC 


37.30663 


2060.952 



four predictors in the persistence models which were 
measured at a fixed point in time (t*), the new predic- 
tors are measured at each time t, where t is between the 
turning point t* and the saturated time t e . The time- 
dependent covariates measures "what was going on in 
the environment" before the hashtag's saturation. For 
each hashtag i, we have: 

• rtEnv t : the total number of new retweets that do 
not contain hashtag i, at time t. 

• rpEnv t : the number of replies that do not contain 
hashtag i, at time t. 

• srcEnvf : the number of new retweet sources that 
do not contain hashtag % up to time t. 

These three new time-dependent covariates, together 
with the four fixed point covariates, are included in 
the new persistence models. The estimated results are 
summarized in Table [7] Like the growth model, we 
see no impact from the environment on this model for 
predicting winner hashtags' persistence. For also-ran 
hashtags, the rtEnv t and rpEnv t increase the hazard 
of becoming saturated and they are negatively associ- 
ated with persistence. The reflects the fact that many 
of the also-ran hashtags died out in a couple of hours 
where there were still high tweeting activities going on 
in the environment. 

5 Discussion 

User-generated content with high levels of conversa- 
tional vibrancy should be a focal point for attracting 
the attention and participation of users. These ele- 
ments should satisfy a confluence of users' needs by 
providing topical information, interactive sociality, di- 
verse and novel content, and involvement from well- 
connected users and lead to a feedback loop that rein- 



forces these tendencies. Operationalizing this model 
for Twitter in particular, we investigated how these 
features of conversational vitality influenced both the 
growth and persistence of adoption for different classes 
of hashtags. 

Summary of findings. The number of times a hash- 
tag is retweeted (topicality) as well as the popularity 
of the users mentioning the hashtag (prominence) lead 
to more rapid growth of hashtags for "winner" and 
"also-ran" hashtags alike. Additional replies (interac- 
tivity) and the number unique retweet sources (diver- 
sity) support the persistence of "winner" and "also-ran" 
hashtags, respectively. In addition, our findings unex- 
pectedly suggest that the number retweets inhibit the 
growth of hashtags. Because these findings remain ro- 
bust after controlling for other contextual behavior of 
users, these findings provide mixed evidence for the 
simple cumulative vibrancy model we proposed. In 
particular, the fact that additional incremental activity 
inhibits the growth and persistence of hashtags sug- 
gests there higher-order processes that lead to limits or 
tipping points. 

One direction for extending this conversational vi- 
brancy model would be to draw upon theories from 
organizational ecology that describe the birth, growth, 
and death of organizations and communities. Ecolo- 
gists treat organizations as entities which, like hash- 
tags, coordinate behavior through identities that sug- 
gest the kinds of behaviors and messages that are ap- 
propriate within organizations of a particular form 0. 
In particular, ecological research focuses on the im- 
portance of balancing consistency and coherence with 
flexibility and access to resources. The environment 
provides a limited supply of resources for organizations 
(laborers, customers, etc.) analogous to how the Twit- 
ter environment provides a limited supply of resources 
for hashtags (attention, users, ideas to express, etc.). 
Organizations survive and thrive when their identities 
are specific enough such that individuals know what to 
expect from them but broad enough that they can ad- 
dress a range of needs and access a variety of resources. 
It has been recognized that organizations with similar 
identities tend to thrive as their population grows, with 
each gaining more attention and legitimacy. At some 
point a limit is reached and the environment can no 
longer support most of these organizations, and only 
a few remain 0. 

A substantial portion of the research on growth 



11 



and sustainability involves the comparison of cul- 
tural forms, such as words, names, memes, or net- 
works. Both exposure and "fitness" play important 
roles in these models, suggesting rationales for both 
self-reinforcing growth and saturation OH III- Stud- 
ies of baby naming conventions have found that bursti- 
ness is associated with short life-time because collec- 
tive aversion to "faddishness" limits the growth of a 
name [4]. These findings are corroborated by other 
work on memes that suggest social media have car- 
rying capacities lfl6ll . Our model introduces an addi- 
tional factor to consider: the endogenous and devel- 
oping qualities of the communication and interactions 
between those that use the form, serving to weaken the 
determinism suggested by fitness and first-mover based 
explanations. 

The application of this model to linguistic evolution 
may be particularly appealing. The ways in which lin- 
guistic conventions are shared by a groups have been 
investigated in the field of language dynamics lfl"8l . 
Computational models have shown that simple linguis- 
tic interactions can lead the group to reach a consen- 
sus on a given linguistic convention by the progres- 
sive elimination of competing synonyms |[30l |2l. In- 
terestingly, these models assume that the competing 
synonyms are perfectly equivalent. While the situa- 
tion is clearly more complex with hashtags, our re- 
search demonstrates that the observed winner-takes-all 
dynamics are not necessarily driven only by intrinsic 
properties of hashtags but also by other contextual and 
structural features which suggest new avenues for re- 
search across fields like communication, social com- 
puting, linguistics, and political science. 

Our approach in this analysis also has limitations 
that merit future research. We do not examine more 
platform-specific contextual features such as the ef- 
fect of a hashtag's length (number of characters) on 
its life-cycle, which has significance due to Twitter's 
140-character limit for a message [35]. Despite the 
highly situational and socially-embedded interactions 
in social emdia, our modeling approach assumed the 
observation of hashtags on users are the result of an in- 
dependent process. Future work could specify alterna- 
tive models that account for the local clustering of ties 
or the modular structure of these communities which 
likely strongly influence the growth and persistence of 
hashtag use. 



6 Conclusion 

We introduced a theoretical framework called con- 
versational vibrancy that synthesizes work from so- 
cial computing and mass communication to understand 
why some forms of user-generated content are adopted 
while others are abandoned. The U.S. presidential de- 
bates allowed us to understand the lifecycle of hashtag 
adoption and use in response to exogenous shocks that 
allow us to separate out the confounding influence of 
topical relevance and social exposure. Using simple 
computational means to characterize the key aspects of 
hashtags's complex dynamics, statistical models reveal 
qualitative differences in how conversational vibrancy 
influences the growth and persistence of distinct classes 
of hashtags. The study also demonstrates the power of 
computational social science lfT3l approaches to meld 
data-driven computational and statistical approaches to 
investigate fundamental questions about human social- 
ity. 
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